Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Extraction of Logical Structure from Articles in Mathematics

Identifieur interne : 001534 ( Main/Exploration ); précédent : 001533; suivant : 001535

Extraction of Logical Structure from Articles in Mathematics

Auteurs : Koji Nakagawa [Japon] ; Akihiro Nomura [Japon] ; Masakazu Suzuki (mathématicien) [Japon]

Source :

RBID : ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB

Descripteurs français

English descriptors

Abstract

Abstract: We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this method in INFTY which is an integrated OCR system for mathematical documents. In order to show the effectiveness of the method we made a correct database from an existing mathematical OCR database, and made an experiment.

Url:
DOI: 10.1007/978-3-540-27818-4_20


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Extraction of Logical Structure from Articles in Mathematics</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
</author>
<author>
<name sortKey="Nomura, Akihiro" sort="Nomura, Akihiro" uniqKey="Nomura A" first="Akihiro" last="Nomura">Akihiro Nomura</name>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-27818-4_20</idno>
<idno type="url">https://api.istex.fr/document/1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000207</idno>
<idno type="wicri:Area/Istex/Curation">000204</idno>
<idno type="wicri:Area/Istex/Checkpoint">000D69</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Nakagawa K:extraction:of:logical</idno>
<idno type="wicri:Area/Main/Merge">001585</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:04-0548773</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000507</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000282</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000486</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Nakagawa K:extraction:of:logical</idno>
<idno type="wicri:Area/Main/Merge">001697</idno>
<idno type="wicri:Area/Main/Curation">001534</idno>
<idno type="wicri:Area/Main/Exploration">001534</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Extraction of Logical Structure from Articles in Mathematics</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
<author>
<name sortKey="Nomura, Akihiro" sort="Nomura, Akihiro" uniqKey="Nomura A" first="Akihiro" last="Nomura">Akihiro Nomura</name>
<affiliation wicri:level="4">
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB</idno>
<idno type="DOI">10.1007/978-3-540-27818-4_20</idno>
<idno type="ChapterID">20</idno>
<idno type="ChapterID">Chap20</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Database</term>
<term>Document structure</term>
<term>Information browsing</term>
<term>Knowledge engineering</term>
<term>Mathematics</term>
<term>Optical character recognition</term>
<term>Printed character</term>
<term>Printed document</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Base donnée</term>
<term>Caractère imprimé</term>
<term>Document imprimé</term>
<term>Ingénierie connaissances</term>
<term>Mathématiques</term>
<term>Navigation information</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Structure document</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
<term>Mathématiques</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this method in INFTY which is an integrated OCR system for mathematical documents. In order to show the effectiveness of the method we made a correct database from an existing mathematical OCR database, and made an experiment.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
</region>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<name sortKey="Nomura, Akihiro" sort="Nomura, Akihiro" uniqKey="Nomura A" first="Akihiro" last="Nomura">Akihiro Nomura</name>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001534 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001534 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB
   |texte=   Extraction of Logical Structure from Articles in Mathematics
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024